Overview

Dataset statistics

Number of variables19
Number of observations1025010
Missing cells4912237
Missing cells (%)25.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory148.6 MiB
Average record size in memory152.0 B

Variable types

DateTime1
Categorical14
Boolean2
Numeric1
Unsupported1

Alerts

Sub-product has a high cardinality: 75 distinct valuesHigh cardinality
Issue has a high cardinality: 166 distinct valuesHigh cardinality
Sub-issue has a high cardinality: 218 distinct valuesHigh cardinality
Consumer Complaint has a high cardinality: 268391 distinct valuesHigh cardinality
Company has a high cardinality: 4780 distinct valuesHigh cardinality
State has a high cardinality: 63 distinct valuesHigh cardinality
ZIP code has a high cardinality: 28944 distinct valuesHigh cardinality
Date Sent to Company has a high cardinality: 2292 distinct valuesHigh cardinality
Product is highly overall correlated with Sub-productHigh correlation
Sub-product is highly overall correlated with ProductHigh correlation
Consumer consent provided? is highly overall correlated with Submitted viaHigh correlation
Submitted via is highly overall correlated with Consumer consent provided?High correlation
Company Public Response is highly imbalanced (50.1%)Imbalance
Company Response to Consumer is highly imbalanced (58.9%)Imbalance
Timely response? is highly imbalanced (82.0%)Imbalance
Sub-product has 235170 (22.9%) missing valuesMissing
Sub-issue has 496157 (48.4%) missing valuesMissing
Consumer Complaint has 747196 (72.9%) missing valuesMissing
Company Public Response has 706646 (68.9%) missing valuesMissing
State has 12360 (1.2%) missing valuesMissing
ZIP code has 16718 (1.6%) missing valuesMissing
Tags has 883422 (86.2%) missing valuesMissing
Consumer consent provided? has 533099 (52.0%) missing valuesMissing
Consumer disputed? has 256456 (25.0%) missing valuesMissing
Unnamed: 18 has 1025010 (100.0%) missing valuesMissing
Complaint ID has unique valuesUnique
Unnamed: 18 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-02-01 03:14:53.867795
Analysis finished2023-02-01 03:19:19.729165
Duration4 minutes and 25.86 seconds
Software versionpandas-profiling v0.0.dev0
Download configurationconfig.json

Variables

Distinct2343
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size7.8 MiB
Minimum2011-12-01 00:00:00
Maximum2018-05-01 00:00:00
2023-02-01T11:19:19.816185image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
2023-02-01T11:19:19.935211image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Product
Categorical

Distinct18
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size7.8 MiB
Mortgage
254165 
Debt collection
196212 
Credit reporting
140433 
Credit reporting, credit repair services, or other personal consumer reports
110756 
Credit card
89191 
Other values (13)
234253 

Length

Max length76
Median length41
Mean length20.937055
Min length8

Characters and Unicode

Total characters21460691
Distinct characters32
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMortgage
2nd rowCredit reporting
3rd rowConsumer Loan
4th rowCredit card
5th rowDebt collection

Common Values

ValueCountFrequency (%)
Mortgage 254165
24.8%
Debt collection 196212
19.1%
Credit reporting 140433
13.7%
Credit reporting, credit repair services, or other personal consumer reports 110756
10.8%
Credit card 89191
 
8.7%
Bank account or service 86206
 
8.4%
Student loan 42969
 
4.2%
Consumer Loan 31606
 
3.1%
Credit card or prepaid card 22913
 
2.2%
Checking or savings account 18982
 
1.9%
Other values (8) 31577
 
3.1%

Length

2023-02-01T11:19:20.058239image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
credit 474049
15.7%
or 254637
 
8.4%
mortgage 254165
 
8.4%
reporting 251189
 
8.3%
debt 196212
 
6.5%
collection 196212
 
6.5%
consumer 142362
 
4.7%
card 138836
 
4.6%
personal 115123
 
3.8%
other 111816
 
3.7%
Other values (21) 889493
29.4%

Most occurring characters

ValueCountFrequency (%)
r 2706616
12.6%
e 2519682
11.7%
1999084
 
9.3%
o 1852374
 
8.6%
t 1811201
 
8.4%
i 1318627
 
6.1%
c 1204231
 
5.6%
n 1112049
 
5.2%
a 999354
 
4.7%
g 797483
 
3.7%
Other values (22) 5139990
24.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 18163175
84.6%
Space Separator 1999084
 
9.3%
Uppercase Letter 1056616
 
4.9%
Other Punctuation 241816
 
1.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r 2706616
14.9%
e 2519682
13.9%
o 1852374
10.2%
t 1811201
10.0%
i 1318627
7.3%
c 1204231
 
6.6%
n 1112049
 
6.1%
a 999354
 
5.5%
g 797483
 
4.4%
s 742889
 
4.1%
Other values (11) 3098669
17.1%
Uppercase Letter
ValueCountFrequency (%)
C 413881
39.2%
M 265304
25.1%
D 196212
18.6%
B 86206
 
8.2%
S 42969
 
4.1%
L 31606
 
3.0%
P 13732
 
1.3%
V 5646
 
0.5%
O 1060
 
0.1%
Space Separator
ValueCountFrequency (%)
1999084
100.0%
Other Punctuation
ValueCountFrequency (%)
, 241816
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 19219791
89.6%
Common 2240900
 
10.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
r 2706616
14.1%
e 2519682
13.1%
o 1852374
9.6%
t 1811201
9.4%
i 1318627
 
6.9%
c 1204231
 
6.3%
n 1112049
 
5.8%
a 999354
 
5.2%
g 797483
 
4.1%
s 742889
 
3.9%
Other values (20) 4155285
21.6%
Common
ValueCountFrequency (%)
1999084
89.2%
, 241816
 
10.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 21460691
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
r 2706616
12.6%
e 2519682
11.7%
1999084
 
9.3%
o 1852374
 
8.6%
t 1811201
 
8.4%
i 1318627
 
6.1%
c 1204231
 
5.6%
n 1112049
 
5.2%
a 999354
 
4.7%
g 797483
 
3.7%
Other values (22) 5139990
24.0%

Sub-product
Categorical

HIGH CARDINALITY  HIGH CORRELATION  MISSING 

Distinct75
Distinct (%)< 0.1%
Missing235170
Missing (%)22.9%
Memory size7.8 MiB
Credit reporting
108469 
Other mortgage
86636 
Checking account
73186 
Conventional fixed mortgage
70616 
Other (i.e. phone, health club, etc.)
44561 
Other values (70)
406372 

Length

Max length42
Median length35
Mean length19.676088
Min length4

Characters and Unicode

Total characters15540961
Distinct characters53
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowOther mortgage
2nd rowVehicle loan
3rd rowCredit card
4th rowConventional adjustable mortgage (ARM)
5th rowMedical

Common Values

ValueCountFrequency (%)
Credit reporting 108469
10.6%
Other mortgage 86636
 
8.5%
Checking account 73186
 
7.1%
Conventional fixed mortgage 70616
 
6.9%
Other (i.e. phone, health club, etc.) 44561
 
4.3%
I do not know 39828
 
3.9%
Credit card 28700
 
2.8%
FHA mortgage 28255
 
2.8%
Conventional adjustable mortgage (ARM) 25381
 
2.5%
Non-federal student loan 25166
 
2.5%
Other values (65) 259042
25.3%
(Missing) 235170
22.9%

Length

2023-02-01T11:19:20.170264image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
mortgage 246831
 
11.0%
credit 184812
 
8.3%
other 175929
 
7.9%
conventional 110860
 
4.9%
reporting 108469
 
4.8%
loan 108236
 
4.8%
card 81962
 
3.7%
account 80304
 
3.6%
checking 73186
 
3.3%
fixed 70616
 
3.2%
Other values (93) 998426
44.6%

Most occurring characters

ValueCountFrequency (%)
e 1721330
 
11.1%
1449791
 
9.3%
t 1321353
 
8.5%
r 1160037
 
7.5%
o 1127744
 
7.3%
n 1048287
 
6.7%
a 938073
 
6.0%
i 772549
 
5.0%
g 721499
 
4.6%
d 606042
 
3.9%
Other values (43) 4674256
30.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 12691192
81.7%
Space Separator 1449791
 
9.3%
Uppercase Letter 955332
 
6.1%
Other Punctuation 240956
 
1.6%
Close Punctuation 79949
 
0.5%
Open Punctuation 79949
 
0.5%
Dash Punctuation 43616
 
0.3%
Final Punctuation 176
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 1721330
13.6%
t 1321353
10.4%
r 1160037
9.1%
o 1127744
8.9%
n 1048287
 
8.3%
a 938073
 
7.4%
i 772549
 
6.1%
g 721499
 
5.7%
d 606042
 
4.8%
c 587627
 
4.6%
Other values (15) 2686651
21.2%
Uppercase Letter
ValueCountFrequency (%)
C 341840
35.8%
O 177780
18.6%
A 65283
 
6.8%
M 62870
 
6.6%
I 54157
 
5.7%
F 48432
 
5.1%
H 43582
 
4.6%
V 28498
 
3.0%
R 27996
 
2.9%
N 25166
 
2.6%
Other values (9) 79728
 
8.3%
Other Punctuation
ValueCountFrequency (%)
. 133683
55.5%
, 89122
37.0%
/ 17925
 
7.4%
' 226
 
0.1%
Space Separator
ValueCountFrequency (%)
1449791
100.0%
Close Punctuation
ValueCountFrequency (%)
) 79949
100.0%
Open Punctuation
ValueCountFrequency (%)
( 79949
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 43616
100.0%
Final Punctuation
ValueCountFrequency (%)
176
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 13646524
87.8%
Common 1894437
 
12.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 1721330
12.6%
t 1321353
 
9.7%
r 1160037
 
8.5%
o 1127744
 
8.3%
n 1048287
 
7.7%
a 938073
 
6.9%
i 772549
 
5.7%
g 721499
 
5.3%
d 606042
 
4.4%
c 587627
 
4.3%
Other values (34) 3641983
26.7%
Common
ValueCountFrequency (%)
1449791
76.5%
. 133683
 
7.1%
, 89122
 
4.7%
) 79949
 
4.2%
( 79949
 
4.2%
- 43616
 
2.3%
/ 17925
 
0.9%
' 226
 
< 0.1%
176
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 15540785
> 99.9%
Punctuation 176
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 1721330
 
11.1%
1449791
 
9.3%
t 1321353
 
8.5%
r 1160037
 
7.5%
o 1127744
 
7.3%
n 1048287
 
6.7%
a 938073
 
6.0%
i 772549
 
5.0%
g 721499
 
4.6%
d 606042
 
3.9%
Other values (42) 4674080
30.1%
Punctuation
ValueCountFrequency (%)
176
100.0%

Issue
Categorical

Distinct166
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size7.8 MiB
Loan modification,collection,foreclosure
112314 
Incorrect information on credit report
102687 
Loan servicing, payments, escrow account
77337 
Incorrect information on your report
 
60891
Cont'd attempts collect debt not owed
 
60703
Other values (161)
611078 

Length

Max length80
Median length60
Mean length34.483481
Min length4

Characters and Unicode

Total characters35345913
Distinct characters49
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowLoan modification,collection,foreclosure
2nd rowIncorrect information on credit report
3rd rowManaging the loan or lease
4th rowBankruptcy
5th rowCommunication tactics

Common Values

ValueCountFrequency (%)
Loan modification,collection,foreclosure 112314
 
11.0%
Incorrect information on credit report 102687
 
10.0%
Loan servicing, payments, escrow account 77337
 
7.5%
Incorrect information on your report 60891
 
5.9%
Cont'd attempts collect debt not owed 60703
 
5.9%
Account opening, closing, or management 37961
 
3.7%
Disclosure verification of debt 30804
 
3.0%
Communication tactics 29772
 
2.9%
Problem with a credit reporting company's investigation into an existing problem 24400
 
2.4%
Deposits and withdrawals 22851
 
2.2%
Other values (156) 465290
45.4%

Length

2023-02-01T11:19:20.303102image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
loan 234841
 
5.1%
report 192758
 
4.2%
credit 192469
 
4.2%
or 172008
 
3.7%
on 170723
 
3.7%
information 164921
 
3.6%
incorrect 163618
 
3.5%
account 146976
 
3.2%
debt 128708
 
2.8%
modification,collection,foreclosure 112314
 
2.4%
Other values (234) 2950938
63.7%

Most occurring characters

ValueCountFrequency (%)
o 3662301
10.4%
3605264
10.2%
e 3015794
 
8.5%
t 3006506
 
8.5%
n 2910718
 
8.2%
r 2667822
 
7.5%
i 2418176
 
6.8%
c 2045811
 
5.8%
a 1814316
 
5.1%
s 1234995
 
3.5%
Other values (39) 8964210
25.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 29952111
84.7%
Space Separator 3605264
 
10.2%
Uppercase Letter 1094006
 
3.1%
Other Punctuation 694415
 
2.0%
Dash Punctuation 117
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 3662301
12.2%
e 3015794
10.1%
t 3006506
10.0%
n 2910718
9.7%
r 2667822
8.9%
i 2418176
 
8.1%
c 2045811
 
6.8%
a 1814316
 
6.1%
s 1234995
 
4.1%
l 1059530
 
3.5%
Other values (15) 6116142
20.4%
Uppercase Letter
ValueCountFrequency (%)
I 212528
19.4%
L 193639
17.7%
C 172721
15.8%
A 96692
8.8%
D 83945
 
7.7%
P 71272
 
6.5%
M 47425
 
4.3%
T 41029
 
3.8%
F 32951
 
3.0%
U 29227
 
2.7%
Other values (8) 112577
10.3%
Other Punctuation
ValueCountFrequency (%)
, 508033
73.2%
' 117686
 
16.9%
/ 67371
 
9.7%
. 1325
 
0.2%
Space Separator
ValueCountFrequency (%)
3605264
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 117
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 31046117
87.8%
Common 4299796
 
12.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 3662301
11.8%
e 3015794
9.7%
t 3006506
9.7%
n 2910718
9.4%
r 2667822
 
8.6%
i 2418176
 
7.8%
c 2045811
 
6.6%
a 1814316
 
5.8%
s 1234995
 
4.0%
l 1059530
 
3.4%
Other values (33) 7210148
23.2%
Common
ValueCountFrequency (%)
3605264
83.8%
, 508033
 
11.8%
' 117686
 
2.7%
/ 67371
 
1.6%
. 1325
 
< 0.1%
- 117
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 35345913
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 3662301
10.4%
3605264
10.2%
e 3015794
 
8.5%
t 3006506
 
8.5%
n 2910718
 
8.2%
r 2667822
 
7.5%
i 2418176
 
6.8%
c 2045811
 
5.8%
a 1814316
 
5.1%
s 1234995
 
3.5%
Other values (39) 8964210
25.4%

Sub-issue
Categorical

HIGH CARDINALITY  MISSING 

Distinct218
Distinct (%)< 0.1%
Missing496157
Missing (%)48.4%
Memory size7.8 MiB
Account status
 
37057
Debt is not mine
 
36741
Information is not mine
 
32385
Not given enough info to verify debt
 
21818
Information belongs to someone else
 
21809
Other values (213)
379043 

Length

Max length85
Median length68
Mean length30.911734
Min length11

Characters and Unicode

Total characters16347763
Distinct characters56
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st rowAccount status
2nd rowFrequent or repeated calls
3rd rowContacted employer after asked not to
4th rowInadequate help over the phone
5th rowNot given enough info to verify debt

Common Values

ValueCountFrequency (%)
Account status 37057
 
3.6%
Debt is not mine 36741
 
3.6%
Information is not mine 32385
 
3.2%
Not given enough info to verify debt 21818
 
2.1%
Information belongs to someone else 21809
 
2.1%
Debt was paid 21537
 
2.1%
Frequent or repeated calls 17677
 
1.7%
Their investigation did not fix an error on your report 17195
 
1.7%
Account status incorrect 13690
 
1.3%
Attempted to collect wrong amount 12619
 
1.2%
Other values (208) 296325
28.9%
(Missing) 496157
48.4%

Length

2023-02-01T11:19:20.445134image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
not 144610
 
5.5%
debt 123290
 
4.6%
information 106117
 
4.0%
to 105217
 
4.0%
is 84635
 
3.2%
account 81156
 
3.1%
mine 69126
 
2.6%
report 58382
 
2.2%
your 58003
 
2.2%
status 52571
 
2.0%
Other values (408) 1769292
66.7%

Most occurring characters

ValueCountFrequency (%)
2123546
13.0%
e 1587294
 
9.7%
t 1550817
 
9.5%
o 1453768
 
8.9%
n 1199320
 
7.3%
i 1042378
 
6.4%
r 1035813
 
6.3%
a 762615
 
4.7%
s 677960
 
4.1%
c 495737
 
3.0%
Other values (46) 4418515
27.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 13589861
83.1%
Space Separator 2123546
 
13.0%
Uppercase Letter 554809
 
3.4%
Other Punctuation 71272
 
0.4%
Decimal Number 6026
 
< 0.1%
Dash Punctuation 2249
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 1587294
11.7%
t 1550817
11.4%
o 1453768
10.7%
n 1199320
 
8.8%
i 1042378
 
7.7%
r 1035813
 
7.6%
a 762615
 
5.6%
s 677960
 
5.0%
c 495737
 
3.6%
d 488925
 
3.6%
Other values (16) 3295234
24.2%
Uppercase Letter
ValueCountFrequency (%)
D 106139
19.1%
A 92408
16.7%
I 68410
12.3%
P 58183
10.5%
C 56827
10.2%
T 47671
8.6%
R 39615
 
7.1%
N 32345
 
5.8%
F 20796
 
3.7%
O 8032
 
1.4%
Other values (11) 24383
 
4.4%
Decimal Number
ValueCountFrequency (%)
3 1880
31.2%
0 1880
31.2%
8 1133
18.8%
9 1133
18.8%
Other Punctuation
ValueCountFrequency (%)
' 48858
68.6%
/ 17826
 
25.0%
, 4588
 
6.4%
Space Separator
ValueCountFrequency (%)
2123546
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2249
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 14144670
86.5%
Common 2203093
 
13.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 1587294
11.2%
t 1550817
11.0%
o 1453768
 
10.3%
n 1199320
 
8.5%
i 1042378
 
7.4%
r 1035813
 
7.3%
a 762615
 
5.4%
s 677960
 
4.8%
c 495737
 
3.5%
d 488925
 
3.5%
Other values (37) 3850043
27.2%
Common
ValueCountFrequency (%)
2123546
96.4%
' 48858
 
2.2%
/ 17826
 
0.8%
, 4588
 
0.2%
- 2249
 
0.1%
3 1880
 
0.1%
0 1880
 
0.1%
8 1133
 
0.1%
9 1133
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 16347763
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2123546
13.0%
e 1587294
 
9.7%
t 1550817
 
9.5%
o 1453768
 
8.9%
n 1199320
 
7.3%
i 1042378
 
6.4%
r 1035813
 
6.3%
a 762615
 
4.7%
s 677960
 
4.1%
c 495737
 
3.0%
Other values (46) 4418515
27.0%

Consumer Complaint
Categorical

HIGH CARDINALITY  MISSING 

Distinct268391
Distinct (%)96.6%
Missing747196
Missing (%)72.9%
Memory size7.8 MiB
There are many mistakes appear in my report without my understanding.
 
495
Equifax mishandled my information which has led to a breach that puts myself and millions of others at potential risk. I am extremely disappointed with how equifax has handled reporting this breach. Very little was done to notify the public for nearly a month after the breach was detected. I received no email, letter, or phone call and instead had to discover it via social media.
 
118
I am filing this complaint because Experian has ignored my request to provide me with the documents that their company has on file that was used to verify the accounts I disputed. Being that they have gone past the 30 day mark and can not verify these accounts, under Section 611 ( 5 ) ( A ) of the FCRA - they are required to " ... promptly delete all information which can not be verified '' that I have disputed. Please resolve this manner as soon as possible. Thank you.
 
103
I am filing this complaint because TransUnion has ignored my request to provide me with the documents that their company has on file that was used to verify the accounts I disputed. Being that they have gone past the 30 day mark and can not verify these accounts, under Section 611 ( 5 ) ( A ) of the FCRA - they are required to " ... promptly delete all information which can not be verified '' that I have disputed. Please resolve this manner as soon as possible. Thank you.
 
50
This company continues to report on my credit report after I sent them a letter telling them that this account was not mine and I have no idea what it is or who it belongs to! I asked for proof of a signed contract, I asked for a license to collect in my state, I asked for copies of all information referenced for this debt and still to date, I have not received anything but harassment from this company! THIS IS NOT MY DEBT! I WANT THIS ACCOUNT REMOVED FROM MY CREDIT REPORT AND THIS COMPANY TO STOP CONTACTING ME IMMEDIATELY!
 
49
Other values (268386)
276999 

Length

Max length31423
Median length7540
Mean length1071.1507
Min length5

Characters and Unicode

Total characters297580670
Distinct characters112
Distinct categories16 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique262683 ?
Unique (%)94.6%

Sample

1st rowI have outdated information on my credit report that I have previously disputed that has yet to be removed this information is more then seven years old and does not meet credit reporting requirements
2nd rowI purchased a new car on XXXX XXXX. The car dealer called Citizens Bank to get a 10 day payoff on my loan, good till XXXX XXXX. The dealer sent the check the next day. When I balanced my checkbook on XXXX XXXX. I noticed that Citizens bank had taken the automatic payment out of my checking account at XXXX XXXX XXXX Bank. I called Citizens and they stated that they did not close the loan until XXXX XXXX. ( stating that they did not receive the check until XXXX. XXXX. ). I told them that I did not believe that the check took that long to arrive. XXXX told me a check was issued to me for the amount overpaid, they deducted additional interest. Today ( XXXX XXXX, ) I called Citizens Bank again and talked to a supervisor named XXXX, because on XXXX XXXX. I received a letter that the loan had been paid in full ( dated XXXX, XXXX ) but no refund check was included. XXXX stated that they hold any over payment for 10 business days after the loan was satisfied and that my check would be mailed out on Wed. the XX/XX/XXXX.. I questioned her about the delay in posting the dealer payment and she first stated that sometimes it takes 3 or 4 business days to post, then she said they did not receive the check till XXXX XXXX I again told her that I did not believe this and asked where is my money. She then stated that they hold the over payment for 10 business days. I asked her why, and she simply said that is their policy. I asked her if I would receive interest on my money and she stated no. I believe that Citizens bank is deliberately delaying the posting of payment and the return of consumer 's money to make additional interest for the bank. If this is not illegal it should be, it does hurt the consumer and is not ethical. My amount of money lost is minimal but if they are doing this on thousands of car loans a month, then the additional interest earned for them could be staggering. I still have another car loan from Citizens Bank and I am afraid when I trade that car in another year I will run into the same problem again.
3rd rowAn account on my credit report has a mistaken date. I mailed in a debt validation letter to allow XXXX to correct the information. I received a letter in the mail, stating that Experian received my correspondence and found it to be " suspicious '' and that " I did n't write it ''. Experian 's letter is worded to imply that I am incapable of writing my own letter. I was deeply offended by this implication. I called Experian to figure out why my letter was so suspicious. I spoke to a representative who was incredibly unhelpful, She did not effectively answer any questions I asked of her, and she kept ignoring what I was saying regarding the offensive letter and my dispute process. I feel the representative did what she wanted to do, and I am not satisfied. It is STILL not clear to me why I received this letter. I typed this letter, I signed this letter, and I paid to mail this letter, yet Experian willfully disregarded my lawful request. I am disgusted with this entire situation, and I would like for my dispute to be handled appropriately, and I would like for an Experian representative to contact me and give me a real explanation for this letter.
4th rowThis company refuses to provide me verification and validation of debt per my right under the FDCPA. I do not believe this debt is mine.
5th rowThis complaint is in regards to Square Two Financial. Refer to CFPB case number XXXX regarding CACH, L. L. C. Square Two Financial has utilized my entire social security number to include date of birth on the pfd document listed with this complaint. The initial complaint was with CACH, L. L. C. and not Square Two Financial. This is in breach of the following : 1. Identity Theft Assumption and Deterrence Act of XXXX 2. Privacy Act of XXXX XXXX. Social Security XXXX 4. XXXX Privacy Act-which carries a maximum XXXX fine for each calendar cap year. 5. Breach of Title XXXX, XXXX XXXX XXXX XXXX under XXXX and XXXX The solution is to have CACH, L.L.C handle this correction and not Square Two Financial. Two Square Financial submitted the XXXX XXXX XXXX with their subscriber name on the form listed on CFPB case # XXXX they are rendered liable in this matter. In addition, there is an account number associated with this Universal Data Form and they could use that account number instead of a SSN and DOB which is against XXXX XXXX XXXX This is also includes removal of the XXXX XXXX Form off of CFPB case # XXXX listed as a pdf document attached to this case number. Square Two Financial was contacted at XXXXXXXXXXXX as of XXXX/XXXX/XXXX by e-mail in regards to this matter. In addition, all of my information is not for sale and distribution via fax, fax-scanned, copied, stored in a retrieval system, recorded, transmitted digitally or electronically without my expressed written consent. This information is protected under copyright and publishing laws of XXXX XXXX and XXXX XXXX. This information is protected under the XXXX XXXX XXXX XXXX XXXX under the freedom of speech under XXXX XXXX XXXX to include the Uniform Commercial Codes XXXX and XXXX. These rights are reserved world wide.

Common Values

ValueCountFrequency (%)
There are many mistakes appear in my report without my understanding. 495
 
< 0.1%
Equifax mishandled my information which has led to a breach that puts myself and millions of others at potential risk. I am extremely disappointed with how equifax has handled reporting this breach. Very little was done to notify the public for nearly a month after the breach was detected. I received no email, letter, or phone call and instead had to discover it via social media. 118
 
< 0.1%
I am filing this complaint because Experian has ignored my request to provide me with the documents that their company has on file that was used to verify the accounts I disputed. Being that they have gone past the 30 day mark and can not verify these accounts, under Section 611 ( 5 ) ( A ) of the FCRA - they are required to " ... promptly delete all information which can not be verified '' that I have disputed. Please resolve this manner as soon as possible. Thank you. 103
 
< 0.1%
I am filing this complaint because TransUnion has ignored my request to provide me with the documents that their company has on file that was used to verify the accounts I disputed. Being that they have gone past the 30 day mark and can not verify these accounts, under Section 611 ( 5 ) ( A ) of the FCRA - they are required to " ... promptly delete all information which can not be verified '' that I have disputed. Please resolve this manner as soon as possible. Thank you. 50
 
< 0.1%
This company continues to report on my credit report after I sent them a letter telling them that this account was not mine and I have no idea what it is or who it belongs to! I asked for proof of a signed contract, I asked for a license to collect in my state, I asked for copies of all information referenced for this debt and still to date, I have not received anything but harassment from this company! THIS IS NOT MY DEBT! I WANT THIS ACCOUNT REMOVED FROM MY CREDIT REPORT AND THIS COMPANY TO STOP CONTACTING ME IMMEDIATELY! 49
 
< 0.1%
I am filing this complaint because Equifax has ignored my request to provide me with the documents that their company has on file that was used to verify the accounts I disputed. Being that they have gone past the 30 day mark and can not verify these accounts, under Section 611 ( 5 ) ( A ) of the FCRA - they are required to " ... promptly delete all information which can not be verified '' that I have disputed. Please resolve this manner as soon as possible. Thank you. 45
 
< 0.1%
Equifax mishandled my information which has led to a breach that puts myself and millions of others at potential risk. I am extremely disappointed with how equifax has handled reporting this breach. Very little was done to notify the public for nearly a month after the breach was detected. I received no email, letter, or phone call and instead had to discover it via social media. Going forward equifax should be required to monitor every account and notify victims if any fraud occurs. Credit fraud protection should be mandatory for every account, not an option for us to have deal with. This should be rectified firstly by making credit freezing free and refunding everyone who paid for it following this data breach. Requiring a police report is absurd when they clearly know if you were affected. 36
 
< 0.1%
Equifax mishandled my information which has led to a breach that puts myself and millions of others at potential risk. I am extremely disappointed with how Equifax has handled reporting this breach. Very little was done to notify the public for nearly a month after the breach was detected. I received no email, letter, or phone call and instead had to discover it via social media. 34
 
< 0.1%
This company continues to report on my credit report after I sent them a letter telling them that this account was not mine and I have no idea what it is or who it belongs to! I asked for proof of a signed contract, I asked for a license to collect in my state, I asked for copies of all information referenced for this debt and still to date, I have not received anything but harassment from this company! THIS IS NOT MY DEBT! 28
 
< 0.1%
I have been a victim of Identity Theft. I have been trying to work with the Credit Reporting Agency but they are refusing to honor my valid identity theft case thus these incorrect/fraudulent items are still on my credit report and they must be removed immediately but they are do not belong to me. I have provided all of the proof to show that I was a victim of Identity Theft and that to the best of my knowledge these fraudulent accounts do not belong to me. Please take immediate action on my behalf so I can have these items removed, deleted and permanently blocked from my credit report, so that I can get back on track to a normal life. Regards 27
 
< 0.1%
Other values (268381) 276829
 
27.0%
(Missing) 747196
72.9%

Length

2023-02-01T11:19:20.602314image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
xxxx 2433756
 
4.5%
the 2227719
 
4.1%
i 1991983
 
3.7%
to 1835505
 
3.4%
and 1457528
 
2.7%
1108389
 
2.0%
my 1094649
 
2.0%
a 1074779
 
2.0%
of 878837
 
1.6%
that 850846
 
1.6%
Other values (156071) 39332926
72.5%

Most occurring characters

ValueCountFrequency (%)
54644282
18.4%
e 26580718
 
8.9%
t 21026721
 
7.1%
a 18084765
 
6.1%
o 15991705
 
5.4%
n 15559636
 
5.2%
i 13683389
 
4.6%
X 12663859
 
4.3%
r 12099290
 
4.1%
s 11362561
 
3.8%
Other values (102) 95883744
32.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 209363857
70.4%
Space Separator 54644385
 
18.4%
Uppercase Letter 23081068
 
7.8%
Other Punctuation 6168113
 
2.1%
Decimal Number 2399429
 
0.8%
Close Punctuation 489018
 
0.2%
Open Punctuation 473867
 
0.2%
Control 431214
 
0.1%
Currency Symbol 271579
 
0.1%
Dash Punctuation 208258
 
0.1%
Other values (6) 49882
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 26580718
12.7%
t 21026721
 
10.0%
a 18084765
 
8.6%
o 15991705
 
7.6%
n 15559636
 
7.4%
i 13683389
 
6.5%
r 12099290
 
5.8%
s 11362561
 
5.4%
h 9788258
 
4.7%
d 9590564
 
4.6%
Other values (17) 55596250
26.6%
Uppercase Letter
ValueCountFrequency (%)
X 12663859
54.9%
I 2447411
 
10.6%
T 1022894
 
4.4%
A 801679
 
3.5%
E 650304
 
2.8%
C 600196
 
2.6%
S 561734
 
2.4%
O 495524
 
2.1%
N 477136
 
2.1%
R 387862
 
1.7%
Other values (16) 2972469
 
12.9%
Other Punctuation
ValueCountFrequency (%)
. 2912862
47.2%
, 1458788
23.7%
/ 756218
 
12.3%
' 524720
 
8.5%
" 108647
 
1.8%
: 96619
 
1.6%
! 86647
 
1.4%
? 60374
 
1.0%
; 40249
 
0.7%
# 38288
 
0.6%
Other values (8) 84701
 
1.4%
Decimal Number
ValueCountFrequency (%)
0 1144622
47.7%
1 322514
 
13.4%
2 254393
 
10.6%
5 144685
 
6.0%
3 137883
 
5.7%
6 116134
 
4.8%
4 86337
 
3.6%
7 81768
 
3.4%
8 55866
 
2.3%
9 55227
 
2.3%
Math Symbol
ValueCountFrequency (%)
> 11571
32.3%
< 10620
29.6%
+ 7860
21.9%
= 3889
 
10.8%
~ 1108
 
3.1%
| 805
 
2.2%
Control
ValueCountFrequency (%)
427284
99.1%
3921
 
0.9%
 7
 
< 0.1%
 2
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 208234
> 99.9%
15
 
< 0.1%
8
 
< 0.1%
1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
} 252372
51.6%
) 232186
47.5%
] 4460
 
0.9%
Open Punctuation
ValueCountFrequency (%)
{ 252306
53.2%
( 217164
45.8%
[ 4397
 
0.9%
Space Separator
ValueCountFrequency (%)
54644282
> 99.9%
  103
 
< 0.1%
Modifier Symbol
ValueCountFrequency (%)
` 475
96.9%
^ 15
 
3.1%
Final Punctuation
ValueCountFrequency (%)
171
69.0%
77
31.0%
Initial Punctuation
ValueCountFrequency (%)
77
98.7%
1
 
1.3%
Currency Symbol
ValueCountFrequency (%)
$ 271579
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 13212
100.0%
Other Number
ValueCountFrequency (%)
½ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 232444925
78.1%
Common 65135745
 
21.9%

Most frequent character per script

Common
ValueCountFrequency (%)
54644282
83.9%
. 2912862
 
4.5%
, 1458788
 
2.2%
0 1144622
 
1.8%
/ 756218
 
1.2%
' 524720
 
0.8%
427284
 
0.7%
1 322514
 
0.5%
$ 271579
 
0.4%
2 254393
 
0.4%
Other values (49) 2418483
 
3.7%
Latin
ValueCountFrequency (%)
e 26580718
 
11.4%
t 21026721
 
9.0%
a 18084765
 
7.8%
o 15991705
 
6.9%
n 15559636
 
6.7%
i 13683389
 
5.9%
X 12663859
 
5.4%
r 12099290
 
5.2%
s 11362561
 
4.9%
h 9788258
 
4.2%
Other values (43) 75604023
32.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 297580134
> 99.9%
Punctuation 427
 
< 0.1%
None 109
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
54644282
18.4%
e 26580718
 
8.9%
t 21026721
 
7.1%
a 18084765
 
6.1%
o 15991705
 
5.4%
n 15559636
 
5.2%
i 13683389
 
4.6%
X 12663859
 
4.3%
r 12099290
 
4.1%
s 11362561
 
3.8%
Other values (88) 95883208
32.2%
Punctuation
ValueCountFrequency (%)
171
40.0%
77
18.0%
77
18.0%
70
16.4%
15
 
3.5%
8
 
1.9%
7
 
1.6%
1
 
0.2%
1
 
0.2%
None
ValueCountFrequency (%)
  103
94.5%
§ 2
 
1.8%
 2
 
1.8%
é 1
 
0.9%
½ 1
 
0.9%

Company Public Response
Categorical

IMBALANCE  MISSING 

Distinct10
Distinct (%)< 0.1%
Missing706646
Missing (%)68.9%
Memory size7.8 MiB
Company has responded to the consumer and the CFPB and chooses not to provide a public response
198774 
Company chooses not to provide a public response
52473 
Company believes it acted appropriately as authorized by contract or law
48123 
Company believes the complaint is the result of a misunderstanding
 
4571
Company disputes the facts presented in the complaint
 
4180
Other values (5)
 
10243

Length

Max length119
Median length95
Mean length82.503383
Min length48

Characters and Unicode

Total characters26266107
Distinct characters28
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCompany has responded to the consumer and the CFPB and chooses not to provide a public response
2nd rowCompany believes it acted appropriately as authorized by contract or law
3rd rowCompany chooses not to provide a public response
4th rowCompany believes it acted appropriately as authorized by contract or law
5th rowCompany believes it acted appropriately as authorized by contract or law

Common Values

ValueCountFrequency (%)
Company has responded to the consumer and the CFPB and chooses not to provide a public response 198774
 
19.4%
Company chooses not to provide a public response 52473
 
5.1%
Company believes it acted appropriately as authorized by contract or law 48123
 
4.7%
Company believes the complaint is the result of a misunderstanding 4571
 
0.4%
Company disputes the facts presented in the complaint 4180
 
0.4%
Company believes complaint caused principally by actions of third party outside the control or direction of the company 3359
 
0.3%
Company believes complaint is the result of an isolated error 3044
 
0.3%
Company can't verify or dispute the facts in the complaint 1928
 
0.2%
Company believes complaint represents an opportunity for improvement to better serve consumers 1859
 
0.2%
Company believes complaint relates to a discontinued policy or procedure 53
 
< 0.1%
(Missing) 706646
68.9%

Length

2023-02-01T11:19:20.720341image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-02-01T11:19:20.862865image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
ValueCountFrequency (%)
to 451933
 
10.0%
the 428668
 
9.4%
and 397548
 
8.8%
company 321723
 
7.1%
a 255871
 
5.6%
provide 251247
 
5.5%
response 251247
 
5.5%
public 251247
 
5.5%
not 251247
 
5.5%
chooses 251247
 
5.5%
Other values (48) 1428487
31.5%

Most occurring characters

ValueCountFrequency (%)
4222101
16.1%
o 2690935
10.2%
e 2425156
 
9.2%
s 1777912
 
6.8%
n 1738240
 
6.6%
a 1563814
 
6.0%
t 1504548
 
5.7%
p 1465508
 
5.6%
r 1205249
 
4.6%
d 1178658
 
4.5%
Other values (18) 6493986
24.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 20928618
79.7%
Space Separator 4222101
 
16.1%
Uppercase Letter 1113460
 
4.2%
Other Punctuation 1928
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 2690935
12.9%
e 2425156
11.6%
s 1777912
 
8.5%
n 1738240
 
8.3%
a 1563814
 
7.5%
t 1504548
 
7.2%
p 1465508
 
7.0%
r 1205249
 
5.8%
d 1178658
 
5.6%
h 930171
 
4.4%
Other values (12) 4448427
21.3%
Uppercase Letter
ValueCountFrequency (%)
C 517138
46.4%
F 198774
 
17.9%
P 198774
 
17.9%
B 198774
 
17.9%
Space Separator
ValueCountFrequency (%)
4222101
100.0%
Other Punctuation
ValueCountFrequency (%)
' 1928
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 22042078
83.9%
Common 4224029
 
16.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 2690935
12.2%
e 2425156
11.0%
s 1777912
 
8.1%
n 1738240
 
7.9%
a 1563814
 
7.1%
t 1504548
 
6.8%
p 1465508
 
6.6%
r 1205249
 
5.5%
d 1178658
 
5.3%
h 930171
 
4.2%
Other values (16) 5561887
25.2%
Common
ValueCountFrequency (%)
4222101
> 99.9%
' 1928
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 26266107
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4222101
16.1%
o 2690935
10.2%
e 2425156
 
9.2%
s 1777912
 
6.8%
n 1738240
 
6.6%
a 1563814
 
6.0%
t 1504548
 
5.7%
p 1465508
 
5.6%
r 1205249
 
4.6%
d 1178658
 
4.5%
Other values (18) 6493986
24.7%

Company
Categorical

Distinct4780
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size7.8 MiB
EQUIFAX, INC.
83659 
BANK OF AMERICA, NATIONAL ASSOCIATION
74423 
Experian Information Solutions Inc.
72858 
TRANSUNION INTERMEDIATE HOLDINGS, INC.
 
66269
WELLS FARGO & COMPANY
 
62345
Other values (4775)
665456 

Length

Max length88
Median length57
Mean length25.077836
Min length3

Characters and Unicode

Total characters25705033
Distinct characters77
Distinct categories10 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1031 ?
Unique (%)0.1%

Sample

1st rowM&T BANK CORPORATION
2nd rowTRANSUNION INTERMEDIATE HOLDINGS, INC.
3rd rowCITIZENS FINANCIAL GROUP, INC.
4th rowAMERICAN EXPRESS COMPANY
5th rowCITIBANK, N.A.

Common Values

ValueCountFrequency (%)
EQUIFAX, INC. 83659
 
8.2%
BANK OF AMERICA, NATIONAL ASSOCIATION 74423
 
7.3%
Experian Information Solutions Inc. 72858
 
7.1%
TRANSUNION INTERMEDIATE HOLDINGS, INC. 66269
 
6.5%
WELLS FARGO & COMPANY 62345
 
6.1%
JPMORGAN CHASE & CO. 51338
 
5.0%
CITIBANK, N.A. 41763
 
4.1%
CAPITAL ONE FINANCIAL CORPORATION 26983
 
2.6%
OCWEN LOAN SERVICING LLC 26308
 
2.6%
Navient Solutions, LLC. 24376
 
2.4%
Other values (4770) 494688
48.3%

Length

2023-02-01T11:19:21.053908image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
inc 392210
 
11.0%
bank 131787
 
3.7%
128482
 
3.6%
llc 128200
 
3.6%
solutions 107411
 
3.0%
financial 101043
 
2.8%
national 95469
 
2.7%
holdings 91191
 
2.6%
company 91003
 
2.6%
association 88801
 
2.5%
Other values (4017) 2200936
61.9%

Most occurring characters

ValueCountFrequency (%)
2531353
 
9.8%
A 1891681
 
7.4%
N 1886440
 
7.3%
I 1800601
 
7.0%
C 1328945
 
5.2%
O 1316306
 
5.1%
E 1019032
 
4.0%
S 985741
 
3.8%
R 856843
 
3.3%
L 852789
 
3.3%
Other values (67) 11235302
43.7%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 16170299
62.9%
Lowercase Letter 5736963
 
22.3%
Space Separator 2531563
 
9.8%
Other Punctuation 1252448
 
4.9%
Open Punctuation 4301
 
< 0.1%
Close Punctuation 4301
 
< 0.1%
Dash Punctuation 2943
 
< 0.1%
Decimal Number 2073
 
< 0.1%
Final Punctuation 140
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 796911
13.9%
i 604823
10.5%
o 587642
10.2%
e 539319
9.4%
a 469035
8.2%
t 441127
7.7%
r 399434
7.0%
c 335548
 
5.8%
s 304917
 
5.3%
l 271200
 
4.7%
Other values (17) 987007
17.2%
Uppercase Letter
ValueCountFrequency (%)
A 1891681
11.7%
N 1886440
11.7%
I 1800601
11.1%
C 1328945
 
8.2%
O 1316306
 
8.1%
E 1019032
 
6.3%
S 985741
 
6.1%
R 856843
 
5.3%
L 852789
 
5.3%
T 797806
 
4.9%
Other values (16) 3434115
21.2%
Decimal Number
ValueCountFrequency (%)
2 702
33.9%
1 695
33.5%
8 162
 
7.8%
3 149
 
7.2%
4 111
 
5.4%
0 101
 
4.9%
6 95
 
4.6%
9 56
 
2.7%
5 2
 
0.1%
Other Punctuation
ValueCountFrequency (%)
. 624217
49.8%
, 484024
38.6%
& 136032
 
10.9%
/ 6500
 
0.5%
' 1378
 
0.1%
* 191
 
< 0.1%
" 106
 
< 0.1%
Space Separator
ValueCountFrequency (%)
2531353
> 99.9%
  210
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 2895
98.4%
48
 
1.6%
Open Punctuation
ValueCountFrequency (%)
( 4301
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4301
100.0%
Final Punctuation
ValueCountFrequency (%)
140
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 21907262
85.2%
Common 3797771
 
14.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 1891681
 
8.6%
N 1886440
 
8.6%
I 1800601
 
8.2%
C 1328945
 
6.1%
O 1316306
 
6.0%
E 1019032
 
4.7%
S 985741
 
4.5%
R 856843
 
3.9%
L 852789
 
3.9%
T 797806
 
3.6%
Other values (43) 9171078
41.9%
Common
ValueCountFrequency (%)
2531353
66.7%
. 624217
 
16.4%
, 484024
 
12.7%
& 136032
 
3.6%
/ 6500
 
0.2%
( 4301
 
0.1%
) 4301
 
0.1%
- 2895
 
0.1%
' 1378
 
< 0.1%
2 702
 
< 0.1%
Other values (14) 2068
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 25704378
> 99.9%
None 467
 
< 0.1%
Punctuation 188
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2531353
 
9.8%
A 1891681
 
7.4%
N 1886440
 
7.3%
I 1800601
 
7.0%
C 1328945
 
5.2%
O 1316306
 
5.1%
E 1019032
 
4.0%
S 985741
 
3.8%
R 856843
 
3.3%
L 852789
 
3.3%
Other values (63) 11234647
43.7%
None
ValueCountFrequency (%)
à 257
55.0%
  210
45.0%
Punctuation
ValueCountFrequency (%)
140
74.5%
48
 
25.5%

State
Categorical

HIGH CARDINALITY  MISSING 

Distinct63
Distinct (%)< 0.1%
Missing12360
Missing (%)1.2%
Memory size7.8 MiB
CA
143662 
FL
98007 
TX
83248 
NY
68892 
GA
 
51672
Other values (58)
567169 

Length

Max length36
Median length2
Mean length2.0002015
Min length2

Characters and Unicode

Total characters2025504
Distinct characters25
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMI
2nd rowAL
3rd rowPA
4th rowID
5th rowVA

Common Values

ValueCountFrequency (%)
CA 143662
 
14.0%
FL 98007
 
9.6%
TX 83248
 
8.1%
NY 68892
 
6.7%
GA 51672
 
5.0%
NJ 39349
 
3.8%
IL 39129
 
3.8%
PA 35499
 
3.5%
VA 31187
 
3.0%
OH 30945
 
3.0%
Other values (53) 391060
38.2%

Length

2023-02-01T11:19:21.158931image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
ca 143662
 
14.2%
fl 98007
 
9.7%
tx 83248
 
8.2%
ny 68892
 
6.8%
ga 51672
 
5.1%
nj 39349
 
3.9%
il 39129
 
3.9%
pa 35499
 
3.5%
va 31187
 
3.1%
oh 30945
 
3.1%
Other values (57) 391084
38.6%

Most occurring characters

ValueCountFrequency (%)
A 357714
17.7%
C 223321
11.0%
N 203990
10.1%
L 160218
 
7.9%
T 120577
 
6.0%
M 114714
 
5.7%
I 100484
 
5.0%
F 98061
 
4.8%
X 83248
 
4.1%
O 79728
 
3.9%
Other values (15) 483449
23.9%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 2025480
> 99.9%
Space Separator 24
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 357714
17.7%
C 223321
11.0%
N 203990
10.1%
L 160218
 
7.9%
T 120577
 
6.0%
M 114714
 
5.7%
I 100484
 
5.0%
F 98061
 
4.8%
X 83248
 
4.1%
O 79728
 
3.9%
Other values (14) 483425
23.9%
Space Separator
ValueCountFrequency (%)
24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2025480
> 99.9%
Common 24
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 357714
17.7%
C 223321
11.0%
N 203990
10.1%
L 160218
 
7.9%
T 120577
 
6.0%
M 114714
 
5.7%
I 100484
 
5.0%
F 98061
 
4.8%
X 83248
 
4.1%
O 79728
 
3.9%
Other values (14) 483425
23.9%
Common
ValueCountFrequency (%)
24
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2025504
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A 357714
17.7%
C 223321
11.0%
N 203990
10.1%
L 160218
 
7.9%
T 120577
 
6.0%
M 114714
 
5.7%
I 100484
 
5.0%
F 98061
 
4.8%
X 83248
 
4.1%
O 79728
 
3.9%
Other values (15) 483449
23.9%

ZIP code
Categorical

HIGH CARDINALITY  MISSING 

Distinct28944
Distinct (%)2.9%
Missing16718
Missing (%)1.6%
Memory size7.8 MiB
300XX
 
5475
770XX
 
4438
750XX
 
3691
606XX
 
3486
331XX
 
3322
Other values (28939)
987880 

Length

Max length6
Median length5
Mean length4.936206
Min length1

Characters and Unicode

Total characters4977137
Distinct characters26
Distinct categories10 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6286 ?
Unique (%)0.6%

Sample

1st row48382
2nd row352XX
3rd row177XX
4th row83854
5th row23233

Common Values

ValueCountFrequency (%)
300XX 5475
 
0.5%
770XX 4438
 
0.4%
750XX 3691
 
0.4%
606XX 3486
 
0.3%
331XX 3322
 
0.3%
330XX 3289
 
0.3%
900XX 3048
 
0.3%
303XX 2888
 
0.3%
945XX 2856
 
0.3%
334XX 2797
 
0.3%
Other values (28934) 973002
94.9%
(Missing) 16718
 
1.6%

Length

2023-02-01T11:19:21.253950image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
300xx 5475
 
0.5%
770xx 4438
 
0.4%
750xx 3691
 
0.4%
606xx 3486
 
0.3%
331xx 3322
 
0.3%
330xx 3289
 
0.3%
900xx 3048
 
0.3%
303xx 2888
 
0.3%
945xx 2856
 
0.3%
334xx 2797
 
0.3%
Other values (28922) 973003
96.5%

Most occurring characters

ValueCountFrequency (%)
0 627856
12.6%
X 588472
11.8%
3 547204
11.0%
1 541416
10.9%
2 518223
10.4%
7 400327
8.0%
4 387040
7.8%
9 368541
7.4%
5 342471
6.9%
8 329250
6.6%
Other values (16) 326337
6.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4388570
88.2%
Uppercase Letter 588473
 
11.8%
Dash Punctuation 56
 
< 0.1%
Open Punctuation 16
 
< 0.1%
Other Punctuation 9
 
< 0.1%
Modifier Symbol 4
 
< 0.1%
Math Symbol 3
 
< 0.1%
Currency Symbol 3
 
< 0.1%
Lowercase Letter 2
 
< 0.1%
Space Separator 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 627856
14.3%
3 547204
12.5%
1 541416
12.3%
2 518223
11.8%
7 400327
9.1%
4 387040
8.8%
9 368541
8.4%
5 342471
7.8%
8 329250
7.5%
6 326242
7.4%
Other Punctuation
ValueCountFrequency (%)
* 4
44.4%
. 2
22.2%
/ 1
 
11.1%
" 1
 
11.1%
! 1
 
11.1%
Uppercase Letter
ValueCountFrequency (%)
X 588472
> 99.9%
M 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
[ 13
81.2%
( 3
 
18.8%
Lowercase Letter
ValueCountFrequency (%)
a 1
50.0%
r 1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 56
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 4
100.0%
Math Symbol
ValueCountFrequency (%)
+ 3
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 3
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4388662
88.2%
Latin 588475
 
11.8%

Most frequent character per script

Common
ValueCountFrequency (%)
0 627856
14.3%
3 547204
12.5%
1 541416
12.3%
2 518223
11.8%
7 400327
9.1%
4 387040
8.8%
9 368541
8.4%
5 342471
7.8%
8 329250
7.5%
6 326242
7.4%
Other values (12) 92
 
< 0.1%
Latin
ValueCountFrequency (%)
X 588472
> 99.9%
M 1
 
< 0.1%
a 1
 
< 0.1%
r 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4977137
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 627856
12.6%
X 588472
11.8%
3 547204
11.0%
1 541416
10.9%
2 518223
10.4%
7 400327
8.0%
4 387040
7.8%
9 368541
7.4%
5 342471
6.9%
8 329250
6.6%
Other values (16) 326337
6.6%

Tags
Categorical

Distinct3
Distinct (%)< 0.1%
Missing883422
Missing (%)86.2%
Memory size7.8 MiB
Older American
68727 
Servicemember
61656 
Older American, Servicemember
11205 

Length

Max length29
Median length14
Mean length14.75161
Min length13

Characters and Unicode

Total characters2088651
Distinct characters16
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowOlder American
2nd rowOlder American
3rd rowOlder American
4th rowServicemember
5th rowServicemember

Common Values

ValueCountFrequency (%)
Older American 68727
 
6.7%
Servicemember 61656
 
6.0%
Older American, Servicemember 11205
 
1.1%
(Missing) 883422
86.2%

Length

2023-02-01T11:19:21.349734image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-02-01T11:19:21.456758image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
ValueCountFrequency (%)
older 79932
34.3%
american 79932
34.3%
servicemember 72861
31.3%

Most occurring characters

ValueCountFrequency (%)
e 451308
21.6%
r 305586
14.6%
m 225654
10.8%
i 152793
 
7.3%
c 152793
 
7.3%
91137
 
4.4%
O 79932
 
3.8%
l 79932
 
3.8%
d 79932
 
3.8%
A 79932
 
3.8%
Other values (6) 389652
18.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1753584
84.0%
Uppercase Letter 232725
 
11.1%
Space Separator 91137
 
4.4%
Other Punctuation 11205
 
0.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 451308
25.7%
r 305586
17.4%
m 225654
12.9%
i 152793
 
8.7%
c 152793
 
8.7%
l 79932
 
4.6%
d 79932
 
4.6%
a 79932
 
4.6%
n 79932
 
4.6%
v 72861
 
4.2%
Uppercase Letter
ValueCountFrequency (%)
O 79932
34.3%
A 79932
34.3%
S 72861
31.3%
Space Separator
ValueCountFrequency (%)
91137
100.0%
Other Punctuation
ValueCountFrequency (%)
, 11205
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1986309
95.1%
Common 102342
 
4.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 451308
22.7%
r 305586
15.4%
m 225654
11.4%
i 152793
 
7.7%
c 152793
 
7.7%
O 79932
 
4.0%
l 79932
 
4.0%
d 79932
 
4.0%
A 79932
 
4.0%
a 79932
 
4.0%
Other values (4) 298515
15.0%
Common
ValueCountFrequency (%)
91137
89.1%
, 11205
 
10.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2088651
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 451308
21.6%
r 305586
14.6%
m 225654
10.8%
i 152793
 
7.3%
c 152793
 
7.3%
91137
 
4.4%
O 79932
 
3.8%
l 79932
 
3.8%
d 79932
 
3.8%
A 79932
 
3.8%
Other values (6) 389652
18.7%

Consumer consent provided?
Categorical

HIGH CORRELATION  MISSING 

Distinct4
Distinct (%)< 0.1%
Missing533099
Missing (%)52.0%
Memory size7.8 MiB
Consent provided
277814 
Consent not provided
200503 
Other
 
12605
Consent withdrawn
 
989

Length

Max length20
Median length16
Mean length17.350541
Min length5

Characters and Unicode

Total characters8534922
Distinct characters16
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowConsent provided
2nd rowConsent provided
3rd rowConsent not provided
4th rowConsent provided
5th rowConsent provided

Common Values

ValueCountFrequency (%)
Consent provided 277814
27.1%
Consent not provided 200503
 
19.6%
Other 12605
 
1.2%
Consent withdrawn 989
 
0.1%
(Missing) 533099
52.0%

Length

2023-02-01T11:19:21.549780image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-02-01T11:19:21.649802image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
ValueCountFrequency (%)
consent 479306
40.9%
provided 478317
40.8%
not 200503
17.1%
other 12605
 
1.1%
withdrawn 989
 
0.1%

Most occurring characters

ValueCountFrequency (%)
n 1160104
13.6%
o 1158126
13.6%
e 970228
11.4%
d 957623
11.2%
t 693403
8.1%
679809
8.0%
r 491911
5.8%
C 479306
5.6%
s 479306
5.6%
i 479306
5.6%
Other values (6) 985800
11.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 7363202
86.3%
Space Separator 679809
 
8.0%
Uppercase Letter 491911
 
5.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 1160104
15.8%
o 1158126
15.7%
e 970228
13.2%
d 957623
13.0%
t 693403
9.4%
r 491911
6.7%
s 479306
6.5%
i 479306
6.5%
p 478317
6.5%
v 478317
6.5%
Other values (3) 16561
 
0.2%
Uppercase Letter
ValueCountFrequency (%)
C 479306
97.4%
O 12605
 
2.6%
Space Separator
ValueCountFrequency (%)
679809
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 7855113
92.0%
Common 679809
 
8.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 1160104
14.8%
o 1158126
14.7%
e 970228
12.4%
d 957623
12.2%
t 693403
8.8%
r 491911
6.3%
C 479306
6.1%
s 479306
6.1%
i 479306
6.1%
p 478317
6.1%
Other values (5) 507483
6.5%
Common
ValueCountFrequency (%)
679809
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8534922
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n 1160104
13.6%
o 1158126
13.6%
e 970228
11.4%
d 957623
11.2%
t 693403
8.1%
679809
8.0%
r 491911
5.8%
C 479306
5.6%
s 479306
5.6%
i 479306
5.6%
Other values (6) 985800
11.6%

Submitted via
Categorical

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size7.8 MiB
Web
734874 
Referral
151535 
Phone
 
63506
Postal mail
 
59430
Fax
 
15300

Length

Max length11
Median length3
Mean length4.3276524
Min length3

Characters and Unicode

Total characters4435887
Distinct characters20
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowReferral
2nd rowWeb
3rd rowWeb
4th rowWeb
5th rowWeb

Common Values

ValueCountFrequency (%)
Web 734874
71.7%
Referral 151535
 
14.8%
Phone 63506
 
6.2%
Postal mail 59430
 
5.8%
Fax 15300
 
1.5%
Email 365
 
< 0.1%

Length

2023-02-01T11:19:21.743823image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-02-01T11:19:21.857660image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
ValueCountFrequency (%)
web 734874
67.8%
referral 151535
 
14.0%
phone 63506
 
5.9%
postal 59430
 
5.5%
mail 59430
 
5.5%
fax 15300
 
1.4%
email 365
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
e 1101450
24.8%
W 734874
16.6%
b 734874
16.6%
r 303070
 
6.8%
a 286060
 
6.4%
l 270760
 
6.1%
R 151535
 
3.4%
f 151535
 
3.4%
o 122936
 
2.8%
P 122936
 
2.8%
Other values (10) 455857
10.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 3351447
75.6%
Uppercase Letter 1025010
 
23.1%
Space Separator 59430
 
1.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 1101450
32.9%
b 734874
21.9%
r 303070
 
9.0%
a 286060
 
8.5%
l 270760
 
8.1%
f 151535
 
4.5%
o 122936
 
3.7%
h 63506
 
1.9%
n 63506
 
1.9%
m 59795
 
1.8%
Other values (4) 193955
 
5.8%
Uppercase Letter
ValueCountFrequency (%)
W 734874
71.7%
R 151535
 
14.8%
P 122936
 
12.0%
F 15300
 
1.5%
E 365
 
< 0.1%
Space Separator
ValueCountFrequency (%)
59430
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 4376457
98.7%
Common 59430
 
1.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 1101450
25.2%
W 734874
16.8%
b 734874
16.8%
r 303070
 
6.9%
a 286060
 
6.5%
l 270760
 
6.2%
R 151535
 
3.5%
f 151535
 
3.5%
o 122936
 
2.8%
P 122936
 
2.8%
Other values (9) 396427
 
9.1%
Common
ValueCountFrequency (%)
59430
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4435887
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 1101450
24.8%
W 734874
16.6%
b 734874
16.6%
r 303070
 
6.8%
a 286060
 
6.4%
l 270760
 
6.1%
R 151535
 
3.4%
f 151535
 
3.4%
o 122936
 
2.8%
P 122936
 
2.8%
Other values (10) 455857
10.3%
Distinct2292
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size7.8 MiB
09-08-2017
 
3387
09-09-2017
 
2656
01/19/2017
 
1613
09/13/2017
 
1535
01/20/2017
 
1467
Other values (2287)
1014352 

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters10250100
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)< 0.1%

Sample

1st row03/17/2014
2nd row10-05-2016
3rd row10/20/2016
4th row06-10-2014
5th row09/13/2014

Common Values

ValueCountFrequency (%)
09-08-2017 3387
 
0.3%
09-09-2017 2656
 
0.3%
01/19/2017 1613
 
0.2%
09/13/2017 1535
 
0.1%
01/20/2017 1467
 
0.1%
09/14/2017 1264
 
0.1%
01/24/2017 1235
 
0.1%
04-10-2018 1205
 
0.1%
01/25/2017 1148
 
0.1%
04-11-2018 1143
 
0.1%
Other values (2282) 1008357
98.4%

Length

2023-02-01T11:19:21.949681image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
09-08-2017 3387
 
0.3%
09-09-2017 2656
 
0.3%
01/19/2017 1613
 
0.2%
09/13/2017 1535
 
0.1%
01/20/2017 1467
 
0.1%
09/14/2017 1264
 
0.1%
01/24/2017 1235
 
0.1%
04-10-2018 1205
 
0.1%
01/25/2017 1148
 
0.1%
04-11-2018 1143
 
0.1%
Other values (2282) 1008357
98.4%

Most occurring characters

ValueCountFrequency (%)
0 2301429
22.5%
1 1892539
18.5%
2 1695246
16.5%
/ 1234084
12.0%
- 815936
 
8.0%
7 425544
 
4.2%
6 371280
 
3.6%
3 363862
 
3.5%
4 343633
 
3.4%
5 342655
 
3.3%
Other values (2) 463892
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 8200080
80.0%
Other Punctuation 1234084
 
12.0%
Dash Punctuation 815936
 
8.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2301429
28.1%
1 1892539
23.1%
2 1695246
20.7%
7 425544
 
5.2%
6 371280
 
4.5%
3 363862
 
4.4%
4 343633
 
4.2%
5 342655
 
4.2%
8 273368
 
3.3%
9 190524
 
2.3%
Other Punctuation
ValueCountFrequency (%)
/ 1234084
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 815936
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 10250100
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2301429
22.5%
1 1892539
18.5%
2 1695246
16.5%
/ 1234084
12.0%
- 815936
 
8.0%
7 425544
 
4.2%
6 371280
 
3.6%
3 363862
 
3.5%
4 343633
 
3.4%
5 342655
 
3.3%
Other values (2) 463892
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 10250100
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2301429
22.5%
1 1892539
18.5%
2 1695246
16.5%
/ 1234084
12.0%
- 815936
 
8.0%
7 425544
 
4.2%
6 371280
 
3.6%
3 363862
 
3.5%
4 343633
 
3.4%
5 342655
 
3.3%
Other values (2) 463892
 
4.5%
Distinct8
Distinct (%)< 0.1%
Missing3
Missing (%)< 0.1%
Memory size7.8 MiB
Closed with explanation
786749 
Closed with non-monetary relief
123665 
Closed with monetary relief
 
62358
Closed without relief
 
17868
Closed
 
17611
Other values (3)
 
16756

Length

Max length31
Median length23
Mean length23.751077
Min length6

Characters and Unicode

Total characters24345020
Distinct characters24
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowClosed with explanation
2nd rowClosed with explanation
3rd rowClosed with explanation
4th rowClosed with explanation
5th rowClosed with explanation

Common Values

ValueCountFrequency (%)
Closed with explanation 786749
76.8%
Closed with non-monetary relief 123665
 
12.1%
Closed with monetary relief 62358
 
6.1%
Closed without relief 17868
 
1.7%
Closed 17611
 
1.7%
In progress 6423
 
0.6%
Closed with relief 5304
 
0.5%
Untimely response 5029
 
0.5%
(Missing) 3
 
< 0.1%

Length

2023-02-01T11:19:22.036835image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-02-01T11:19:22.147860image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
ValueCountFrequency (%)
closed 1013555
31.5%
with 978076
30.4%
explanation 786749
24.5%
relief 209195
 
6.5%
non-monetary 123665
 
3.8%
monetary 62358
 
1.9%
without 17868
 
0.6%
in 6423
 
0.2%
progress 6423
 
0.2%
untimely 5029
 
0.2%

Most occurring characters

ValueCountFrequency (%)
e 2426227
10.0%
2189363
 
9.0%
o 2139312
 
8.8%
n 2023332
 
8.3%
l 2014528
 
8.3%
i 1996917
 
8.2%
t 1991613
 
8.2%
a 1759521
 
7.2%
s 1036459
 
4.3%
C 1013555
 
4.2%
Other values (14) 5754193
23.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 21006985
86.3%
Space Separator 2189363
 
9.0%
Uppercase Letter 1025007
 
4.2%
Dash Punctuation 123665
 
0.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 2426227
11.5%
o 2139312
10.2%
n 2023332
9.6%
l 2014528
9.6%
i 1996917
9.5%
t 1991613
9.5%
a 1759521
8.4%
s 1036459
 
4.9%
d 1013555
 
4.8%
w 995944
 
4.7%
Other values (9) 3609577
17.2%
Uppercase Letter
ValueCountFrequency (%)
C 1013555
98.9%
I 6423
 
0.6%
U 5029
 
0.5%
Space Separator
ValueCountFrequency (%)
2189363
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 123665
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 22031992
90.5%
Common 2313028
 
9.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 2426227
11.0%
o 2139312
9.7%
n 2023332
9.2%
l 2014528
9.1%
i 1996917
9.1%
t 1991613
9.0%
a 1759521
 
8.0%
s 1036459
 
4.7%
C 1013555
 
4.6%
d 1013555
 
4.6%
Other values (12) 4616973
21.0%
Common
ValueCountFrequency (%)
2189363
94.7%
- 123665
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 24345020
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 2426227
10.0%
2189363
 
9.0%
o 2139312
 
8.8%
n 2023332
 
8.3%
l 2014528
 
8.3%
i 1996917
 
8.2%
t 1991613
 
8.2%
a 1759521
 
7.2%
s 1036459
 
4.3%
C 1013555
 
4.2%
Other values (14) 5754193
23.6%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1001.1 KiB
True
997239 
False
 
27771
ValueCountFrequency (%)
True 997239
97.3%
False 27771
 
2.7%
2023-02-01T11:19:22.258884image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
Distinct2
Distinct (%)< 0.1%
Missing256456
Missing (%)25.0%
Memory size2.0 MiB
False
620176 
True
148378 
(Missing)
256456 
ValueCountFrequency (%)
False 620176
60.5%
True 148378
 
14.5%
(Missing) 256456
25.0%
2023-02-01T11:19:22.340903image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/

Complaint ID
Real number (ℝ)

Distinct1025010
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1646972.5
Minimum1
Maximum2893554
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.8 MiB
2023-02-01T11:19:22.443927image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile148604.9
Q1901395.25
median1747510.5
Q32464602.8
95-th percentile2814900.1
Maximum2893554
Range2893553
Interquartile range (IQR)1563207.5

Descriptive statistics

Standard deviation873386.73
Coefficient of variation (CV)0.53029831
Kurtosis-1.2011559
Mean1646972.5
Median Absolute Deviation (MAD)756103.5
Skewness-0.26976191
Sum1.6881633 × 1012
Variance7.6280439 × 1011
MonotonicityNot monotonic
2023-02-01T11:19:22.554951image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
759217 1
 
< 0.1%
165317 1
 
< 0.1%
2278830 1
 
< 0.1%
2095989 1
 
< 0.1%
1397098 1
 
< 0.1%
1535381 1
 
< 0.1%
1284423 1
 
< 0.1%
665918 1
 
< 0.1%
380536 1
 
< 0.1%
1186785 1
 
< 0.1%
Other values (1025000) 1025000
> 99.9%
ValueCountFrequency (%)
1 1
< 0.1%
5 1
< 0.1%
7 1
< 0.1%
16 1
< 0.1%
20 1
< 0.1%
22 1
< 0.1%
24 1
< 0.1%
26 1
< 0.1%
27 1
< 0.1%
36 1
< 0.1%
ValueCountFrequency (%)
2893554 1
< 0.1%
2893480 1
< 0.1%
2893449 1
< 0.1%
2893438 1
< 0.1%
2893330 1
< 0.1%
2893320 1
< 0.1%
2893311 1
< 0.1%
2893281 1
< 0.1%
2893246 1
< 0.1%
2893171 1
< 0.1%

Unnamed: 18
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1025010
Missing (%)100.0%
Memory size7.8 MiB

Interactions

2023-02-01T11:19:09.670997image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/

Correlations

2023-02-01T11:19:22.655973image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
Complaint IDProductSub-productCompany Public ResponseStateTagsConsumer consent provided?Submitted viaCompany Response to ConsumerTimely response?Consumer disputed?
Complaint ID1.0000.2720.3130.2730.0340.2240.0390.1220.2060.0460.059
Product0.2721.0000.8630.2210.0480.2710.1210.1680.1810.1390.067
Sub-product0.3130.8631.0000.2370.0340.3200.1190.2850.1750.1510.080
Company Public Response0.2730.2210.2371.0000.0290.1430.0330.0400.1090.0870.071
State0.0340.0480.0340.0291.0000.1480.0330.0560.0270.0220.027
Tags0.2240.2710.3200.1430.1481.0000.0580.1440.0740.0290.004
Consumer consent provided?0.0390.1210.1190.0330.0330.0581.0001.0000.0720.0150.053
Submitted via0.1220.1680.2850.0400.0560.1441.0001.0000.0640.0240.068
Company Response to Consumer0.2060.1810.1750.1090.0270.0740.0720.0641.0000.4260.109
Timely response?0.0460.1390.1510.0870.0220.0290.0150.0240.4261.0000.032
Consumer disputed?0.0590.0670.0800.0710.0270.0040.0530.0680.1090.0321.000

Missing values

2023-02-01T11:19:11.415576image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-02-01T11:19:13.504454image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-02-01T11:19:18.319612image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

Date receivedProductSub-productIssueSub-issueConsumer ComplaintCompany Public ResponseCompanyStateZIP codeTagsConsumer consent provided?Submitted viaDate Sent to CompanyCompany Response to ConsumerTimely response?Consumer disputed?Complaint IDUnnamed: 18
02014-03-12MortgageOther mortgageLoan modification,collection,foreclosureNaNNaNNaNM&T BANK CORPORATIONMI48382NaNNaNReferral03/17/2014Closed with explanationYesNo759217NaN
12016-10-01Credit reportingNaNIncorrect information on credit reportAccount statusI have outdated information on my credit report that I have previously disputed that has yet to be removed this information is more then seven years old and does not meet credit reporting requirementsCompany has responded to the consumer and the CFPB and chooses not to provide a public responseTRANSUNION INTERMEDIATE HOLDINGS, INC.AL352XXNaNConsent providedWeb10-05-2016Closed with explanationYesNo2141773NaN
22016-10-17Consumer LoanVehicle loanManaging the loan or leaseNaNI purchased a new car on XXXX XXXX. The car dealer called Citizens Bank to get a 10 day payoff on my loan, good till XXXX XXXX. The dealer sent the check the next day. When I balanced my checkbook on XXXX XXXX. I noticed that Citizens bank had taken the automatic payment out of my checking account at XXXX XXXX XXXX Bank. I called Citizens and they stated that they did not close the loan until XXXX XXXX. ( stating that they did not receive the check until XXXX. XXXX. ). I told them that I did not believe that the check took that long to arrive. XXXX told me a check was issued to me for the amount overpaid, they deducted additional interest. Today ( XXXX XXXX, ) I called Citizens Bank again and talked to a supervisor named XXXX, because on XXXX XXXX. I received a letter that the loan had been paid in full ( dated XXXX, XXXX ) but no refund check was included. XXXX stated that they hold any over payment for 10 business days after the loan was satisfied and that my check would be mailed out on Wed. the XX/XX/XXXX.. I questioned her about the delay in posting the dealer payment and she first stated that sometimes it takes 3 or 4 business days to post, then she said they did not receive the check till XXXX XXXX I again told her that I did not believe this and asked where is my money. She then stated that they hold the over payment for 10 business days. I asked her why, and she simply said that is their policy. I asked her if I would receive interest on my money and she stated no. I believe that Citizens bank is deliberately delaying the posting of payment and the return of consumer 's money to make additional interest for the bank. If this is not illegal it should be, it does hurt the consumer and is not ethical. My amount of money lost is minimal but if they are doing this on thousands of car loans a month, then the additional interest earned for them could be staggering. I still have another car loan from Citizens Bank and I am afraid when I trade that car in another year I will run into the same problem again.NaNCITIZENS FINANCIAL GROUP, INC.PA177XXOlder AmericanConsent providedWeb10/20/2016Closed with explanationYesNo2163100NaN
32014-06-08Credit cardNaNBankruptcyNaNNaNNaNAMERICAN EXPRESS COMPANYID83854Older AmericanNaNWeb06-10-2014Closed with explanationYesYes885638NaN
42014-09-13Debt collectionCredit cardCommunication tacticsFrequent or repeated callsNaNNaNCITIBANK, N.A.VA23233NaNNaNWeb09/13/2014Closed with explanationYesYes1027760NaN
52013-11-13MortgageConventional adjustable mortgage (ARM)Loan servicing, payments, escrow accountNaNNaNNaNU.S. BANCORPMN48322NaNNaNPhone11/20/2013Closed with monetary reliefYesNo596562NaN
62015-06-16Debt collectionMedicalImproper contact or sharing of infoContacted employer after asked not toNaNCompany believes it acted appropriately as authorized by contract or lawCalifornia Accounts ServiceCA92111NaNConsent not providedWeb06/19/2015Closed with explanationYesNo1422680NaN
72015-06-15Credit reportingNaNCredit reporting company's investigationInadequate help over the phoneAn account on my credit report has a mistaken date. I mailed in a debt validation letter to allow XXXX to correct the information. I received a letter in the mail, stating that Experian received my correspondence and found it to be " suspicious '' and that " I did n't write it ''. Experian 's letter is worded to imply that I am incapable of writing my own letter. I was deeply offended by this implication. \nI called Experian to figure out why my letter was so suspicious. I spoke to a representative who was incredibly unhelpful, She did not effectively answer any questions I asked of her, and she kept ignoring what I was saying regarding the offensive letter and my dispute process. I feel the representative did what she wanted to do, and I am not satisfied. It is STILL not clear to me why I received this letter. I typed this letter, I signed this letter, and I paid to mail this letter, yet Experian willfully disregarded my lawful request. \nI am disgusted with this entire situation, and I would like for my dispute to be handled appropriately, and I would like for an Experian representative to contact me and give me a real explanation for this letter.Company chooses not to provide a public responseExperian Information Solutions Inc.VA224XXNaNConsent providedWeb06/15/2015Closed with explanationYesNo1420702NaN
82015-11-13MortgageOther mortgageLoan modification,collection,foreclosureNaNNaNCompany believes it acted appropriately as authorized by contract or lawAldridge Pite, LLPCA93101NaNNaNReferral12-10-2015Closed with explanationYesYes1654890NaN
92014-10-21MortgageConventional fixed mortgageLoan modification,collection,foreclosureNaNNaNNaNOCWEN LOAN SERVICING LLCFL32714Older AmericanNaNWeb10/21/2014Closed with explanationYesNo1079567NaN
Date receivedProductSub-productIssueSub-issueConsumer ComplaintCompany Public ResponseCompanyStateZIP codeTagsConsumer consent provided?Submitted viaDate Sent to CompanyCompany Response to ConsumerTimely response?Consumer disputed?Complaint IDUnnamed: 18
10250002015-09-17Credit cardNaNRewardsNaNBarclay closed my Barclay XXXX MasterCard account XX/XX/XXXX, and sent a letter indicating the following : " We have recently conducted a review of our accounts. Following this review, we regret to advise you that we are unable to maintain an account with you because of your history of account usage. Your account has been closed in accordance with the terms and conditions of the Cardmember Agreement that governs your account. '' This sudden closure of my account came as a surprise to me and I am very upset as I had XXXX Barclay Arrival points ( worth approximately {$2900.00} in travel expenses ) in this account that I can no longer access because the account is now closed. It is truly disheartening to have worked so hard to earn those points, only to lose them in such a shocking and abrupt manner. I have been a Barclay 's credit card customer XX/XX/XXXX, and I am disappointed with the manner in which they have closed my account. My expectation is that Barclay should have at least notified me about what I had been doing that was unsatisfactory, so that I could have taken the necessary actions to avoid being closed. Instead they closed my account without any warning.Company chooses not to provide a public responseBARCLAYS BANK DELAWAREIN461XXNaNConsent providedWeb09/17/2015Closed with monetary reliefYesNo1568426NaN
10250012014-02-11Credit cardNaNIdentity theft / Fraud / EmbezzlementNaNNaNNaNCAPITAL ONE FINANCIAL CORPORATIONWV25504Older AmericanNaNWeb02-11-2014Closed with monetary reliefYesNo710260NaN
10250022016-11-09Debt collectionMedicalFalse statements or representationAttempted to collect wrong amountOur son was taken to XXXX XXXX XXXX XXXX XXXX XXXX on XXXX XXXX, 2012 as an ER visit. We had insurance through XXXX XXXX at the time and the hospital failed to submit the claim to our insurance company, and has been asking us to pay out of pocket for the services even though the services were covered under our insurance. XXXX says they submitted a claim for {$1200.00} to the insurance company, and when I asked XXXX to explain to me why it was denied, they could n't provide a reason other than simply saying that it " was n't paid '' ; I believe it " was n't paid '' because it was never sent out not to mention that XXXX has no record of this claim amount for XXXX 2012 for our son. \n\nMy husband I followed up numerous times with the hospital asking them to resubmit the appropriate paperwork to the insurance and they never sent the claim to the insurance, even though they reassured us they had. We tried everything possible to get the XXXX parties connected and appropriate documents sent over, but we are the middle men. I spoke with XXXX at XXXX today, and she confirmed that the only claim they received for our son for services in XXXX 2012 was a claim totaling {$670.00}, of which we were responsible for {$250.00} ( {$220.00} deductible amt + {$30.00} co-pay ). XXXX does not show any claim for {$1200.00} ; therefore it was never sent/received.Company believes complaint caused principally by actions of third party outside the control or direction of the companyR & B Corporation of VirginiaNJ077XXNaNConsent providedWeb11-09-2016Closed with explanationYesNo2201681NaN
10250032016-01-22Bank account or serviceChecking accountDeposits and withdrawalsNaNOn XXXX/XXXX/13, without my authorization, Bank of America withdrew {$29000.00} from my personal account to charge off my company credit card ( XXXX - Bank of America ). My corporation had a separate bank account and a Business card account with Bank of America not linked with my personal account.Company chooses not to provide a public responseBANK OF AMERICA, NATIONAL ASSOCIATIONFL347XXNaNConsent providedWeb01/22/2016Closed with monetary reliefYesNo1753439NaN
10250042017-01-26Debt collectionOther (i.e. phone, health club, etc.)Cont'd attempts collect debt not owedDebt is not mineNaNNaNNorth Shore Agency, LLCOR97355NaNNaNReferral01/31/2017Closed with explanationYesNo2313821NaN
10250052017-04-10Debt collectionCredit cardCont'd attempts collect debt not owedDebt is not mineNaNCompany has responded to the consumer and the CFPB and chooses not to provide a public responsePENTAGON FEDERAL CREDIT UNIONTX77802NaNNaNReferral04-11-2017Closed with explanationYesNo2428130NaN
10250062017-02-07Debt collectionOther (i.e. phone, health club, etc.)Cont'd attempts collect debt not owedDebt is not mineI had an account with XXXX in XX/XX/XXXX this was previously disputed for XXXX $ $ because at & t sold their towers in the area of my employer so my father and I whom both work here could not receive any phone calls while in our job, XXXX agreed and deleted it from our credit report. Now they 're saying I owe a combined about of XXXX sounds like XXXX is trying to combine the XXXX XXXX my father and I owed for termination onto my report only.Company believes it acted appropriately as authorized by contract or lawERCNY115XXServicememberConsent providedWeb02-07-2017ClosedYesNo2331270NaN
10250072017-01-04MortgageConventional fixed mortgageApplication, originator, mortgage brokerNaNI was contacted on XX/XX/XXXX email by XXXX from Caliber Home Loans to refinance my current loan with them. I replied on XX/XX/XXXX he gave me an acceptable estimate for refinancing my condo which was already mortgaged with Caliber. XXXX then proceeded to try to get me to refinance my primary home. He presented another favorable quote so I proceeded. We started the process and got both appraisals done and I provided all of the requested documents. The appraisals came back lower than the estimates so XXXX reworked the estimates and I approved and we moved on. This all happened before XXXX. I got a call from my first loan processor XXXX on XX/XX/XXXX herself and stating she was beginning the next phase of my loans. I have never heard another word from XXXX. I began trying to get a status of my loans on XX/XX/XXXX to allow extra time for the holidays. I called and emailed both XX/XX/XXXX and XX/XX/XXXX. Neither of them EVER replied or returned my phone calls. I tried emailing and calling multiple times every day. I called their main number but they were useless. I tried logging onto my account to check the status of my loans but I could n't even access my account because of some IT problem they were having. I finally got someone on the XXXX number to give me someone else 's number. So I started calling XXXX who was supposedly XXXX boss but he never called me back either. On XX/XX/XXXX I got a call back from XXXX and he has been the most responsive but he has gone silent since the holidays. I was then assigned a new loan processor XX/XX/XXXX who basically started all over. I was told my new loan details after the appraisals came back lower were never entered into their system even though I had signed the new loans before XXXX. XXXX sent new estimates and my closing costs were higher even though I was borrowing less. XXXX said it was because my properties were entered as something other than condos and the condo closing costs are higher. He also refused to reimburse me for the appraisals. He said I would get " credit for paying '' them at closing but I would not get reimbursed as XXXX had told me. He also is not honoring the lender credit I was given. I had locked in my interest rates which expired on XX/XX/2017. Supposedly Caliber extended the rate lock until XX/XX/XXXX but I have only received an email about the rate lock. I have not seen any official documentation. Late last week XXXX said there is a problem with the HOA on my primary property because of pending litigation but that it could still be handled and I could close on both properties if the right departments from Caliber was involved. I have not heard a word from XXXX since and the only communication I received at all was from XXXX 's boss yesterday in an informal email about my rate lock extension. \nI know this is a lengthy story but here are my complaints : 1. Deceitful marketing practices. I am confident XXXX intentionally entered my properties into their system as something other than condos to make the deals more attractive in order to get me to refinance. The reason I am confident is because they already hold the loan on one of my properties. \n2. Not honoring the agreement of reimbursing me for the appraisals and not giving me the lender credit as promised. \n3. The total and complete lack of professionalism in their lack of communication with their customer, me. This is the absolute worst experience I have ever had with any institution wanting my business for anything.NaNCaliber Home Loans, Inc.FL336XXNaNConsent providedWeb01-04-2017Closed with explanationYesNo2274241NaN
10250082015-09-28Debt collectionNon-federal student loanDisclosure verification of debtNot given enough info to verify debtNaNCompany chooses not to provide a public responseProgressive Financial Services, Inc.OH44017NaNNaNPostal mail09/30/2015Closed with explanationYesNo1582525NaN
10250092016-08-19Debt collectionPayday loanCont'd attempts collect debt not owedDebt is not mineI had a debit that was included in my chapter XXXX BK, almost two years letter this item showed on my credit reports under collection status for Midwest Recovery Systems. This dropped my credit score XXXX points. I called them and they said their client had n't informed them that it was included in BK, but the damage had already been done. It took them 30 days to remove this incorrectly put item on my credit reports. Its still showing up on my XXXX. I thought this was against the law for them to do that since I am protected by BK laws. They should be fined for this. I should sue.Company believes it acted appropriately as authorized by contract or lawMidwest Recovery SystemsFL336XXNaNConsent providedWeb08/19/2016Closed with explanationYesYes2073214NaN